S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification

نویسندگان

چکیده

Open set classification (OSC) tackles the problem of determining whether data are in-class or out-of-class during inference, when only provided with a examples at training time. Traditional OSC methods usually train discriminative generative models owned data, and then utilize pre-trained to classify test directly. However, these always suffer from embedding confusion problem, i.e., partial instances mixed ones similar semantics, making it difficult classify. To solve this we unify semi-supervised learning develop novel algorithm, S2OSC, which incorporates filtering model re-training in transductive manner. In detail, given pool newly coming S2OSC firstly filters mostly distinct using model, annotates super-class for them. Then, trains holistic by combing labeled remaining unlabeled paradigm. Furthermore, considering that streaming form real applications, extend into an incremental update framework (I-S2OSC), adopt knowledge memory regularization mitigate catastrophic forgetting update. Despite simplicity proposed models, experimental results show achieves state-of-the-art performance across variety tasks, including 85.4% F1 on CIFAR-10 300 pseudo-labels. We also demonstrate how can be expanded setting effectively data.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A semi-supervised approach to question classification

This paper presents a machine learning approach to question classification. We have defined a kernel function based on latent semantic information acquired from unlabeled data. This kernel allows including external semantic knowledge into the supervised learning process. We have combined this knowledge with a bag-of-words approach by means of composite kernels to obtain state-of-the-art results...

متن کامل

Music Genre Classification: A Semi-supervised Approach

Music genres can be seen as categorical descriptions used to classify music basing on various characteristics such as instrumentation, pitch, rhythmic structure, and harmonic contents. Automatic music genre classification is important for music retrieval in large music collections on the web. We build a classifier that learns from very few labeled examples plus a large quantity of unlabeled dat...

متن کامل

A Direct Boosting Approach for Semi-supervised Classification

We introduce a semi-supervised boosting approach (SSDBoost), which directly minimizes the classification errors and maximizes the margins on both labeled and unlabeled samples, without resorting to any upper bounds or approximations. A twostep algorithm based on coordinate descent/ascent is proposed to implement SSDBoost. Experiments on a number of UCI datasets and synthetic data show that SSDB...

متن کامل

A semi-supervised cluster-and-label approach for utterance classification

In this paper, we propose a semi-supervised cluster-and-label algorithm for utterance classification. The approach assumes that the underlying class distribution is roughly captured through– fully unsupervised–clustering. Then, a minimum number of labeled examples is used to automatically label the extracted clusters so that the initial label set is ”augmented” to the whole clustered data. The ...

متن کامل

A New Approach for Semi-supervised Online News Classification

Due to the dramatic increasing of information on the Web, text categorization becomes a useful tool to organize the information. Traditional text categorization problem uses a training set from online sources with pre-defined class labels for text documents. Typically a large amount of online training news should be provided in order to learn a satisfactory categorization scheme. We investigate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Knowledge Discovery From Data

سال: 2021

ISSN: ['1556-472X', '1556-4681']

DOI: https://doi.org/10.1145/3468675